Gene structure conservation aids similarity based gene prediction.
نویسندگان
چکیده
One of the primary tasks in deciphering the functional contents of a newly sequenced genome is the identification of its protein coding genes. Existing computational methods for gene prediction include ab initio methods which use the DNA sequence itself as the only source of information, comparative methods using multiple genomic sequences, and similarity based methods which employ the cDNA or protein sequences of related genes to aid the gene prediction. We present here an algorithm implemented in a computer program called Projector which combines comparative and similarity approaches. Projector employs similarity information at the genomic DNA level by directly using known genes annotated on one DNA sequence to predict the corresponding related genes on another DNA sequence. It therefore makes explicit use of the conservation of the exon-intron structure between two related genes in addition to the similarity of their encoded amino acid sequences. We evaluate the performance of Projector by comparing it with the program Genewise on a test set of 491 pairs of independently confirmed mouse and human genes. It is more accurate than Genewise for genes whose proteins are <80% identical, and is suitable for use in a combined gene prediction system where other methods identify well conserved and non-conserved genes, and pseudogenes.
منابع مشابه
Phylogenetic analysis of HSP70 gene of Aspergillus fumigatus reveals conservation intra-species and divergence inter-species
Aspergillus fumigatus is a saprophyte fungus, widely spread in a variety of ecologicalniches and the most prevalent aspergilli responsible for human and animal invasiveaspergillosis. The first step to develop novel and efficient therapies is the identificationand understanding of the key tolerance and virulence factors of pathogens. The mainfocus of the present study is to perform the similarit...
متن کاملDifferences in Genetic Structure among Fagus orientalis Lipsky (Oriental Beech) Populations under Different Management Conditions: Implications for in situ Gene Conservation
Resource sustainability requires a thorough understanding of the influence of forest management programs on the conservation of genetic diversity in tree populations. To observe how differences in forest management affect the genetic structure of Fagus orientalis Lipsky (oriental beech), we evaluated thirteen beech sites across Hyrcanian forests, based on six microsatellite loci. Significant di...
متن کاملPrediction of 3D protein Structure based on Mutation of AKAP3 and PLOD3 Gene in Case of Non-Obstructive Azoospermia
Background: The present study has been designed with the aim of evaluating A-kinase anchoring proteins 3 (AKAP3)and Procollagen-Lysine, 2-Oxoglutarate 5-Dioxygenase 3 (PLOD3) gene mutations and prediction of 3D proteinstructure for ligand binding activity in the cases of non-obstructive azoospermic male.Materials and Methods: Clinically diagnosed cases of non-obstructive azoos...
متن کاملMolecular characterization of the lipL41 gene of Leptospira interrogans vaccinal serovars in Iran
Leptospirosis caused by infection with pathogenic leptospires, which is the most prevalent zoonotic disease in the world. The outer membrane proteins (OMPs) of pathogenic leptospires such as LipL41 play a crucial role in pathogenesis of this disease. Therefore a major challenge to develop an effective vaccine against leptospirosis is application of basic research on the OMPs of leptospires to i...
متن کاملGenetic characterization of great sturgeon brood stocks- recommendations for the conservation management and aquaculture
The great sturgeon, Huso huso, is one of the most important cultured sturgeon species in Iran, which effective management of aquaculture production of this species requires knowledge of broodstock structure, mating patterns, and genetic diversity of broodstock. The aim of the present study was the application of microsatellite DNA analysis for genetic diversity assessment in the first generatio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Nucleic acids research
دوره 32 2 شماره
صفحات -
تاریخ انتشار 2004